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The conceptual understanding presents a key component of science expertise, and thus the most desirable 
outcome of science education process. However, reports in the literature describe the opposite reality. There is an 
evidence that students are prone to use algorithms, to memorize rules, definitions and procedural steps, that is, to 
use efficient heuristics that allow them to skip logical reasoning for reaching an immediate goal (Dori & Hameiri, 
2003; Hammer, 1994; Nyachwaya, Warfa, Roehrig, & Schneider, 2014). The questions arise: 

(i) How the assessment tools themselves contribute to this phenomenon? 
(ii) Can we encourage the development of logical reasoning among students by applying various assess- 
ment tools? 

The viva voce assessment is probably the oldest type of knowledge examination. Due to its superior character- 
istics (development of communication skills, development of critical thinking, the possibility of diagnosing scientific 
reasoning or misconceptions, profound analysis of student knowledge etc.), this form of examination remained 
the traditional teaching practice for hundreds of years (Huxham, Campbell, & Westwood, 2012). Yet, contemporary 
issues such as the continued expansion of the curricula at all levels of education, large lecture at tertiary level and 
many other objective reasons, have made written exams more widespread in the teaching practice. Unfortunately, 
this has made many students learning for tests, and many teachers teaching to the test, thus ultimately turning 
teaching practice into students’ preparation to test well. 

The simplest written exams, that were most frequently used, consisted of standard multiple-choice questions 
(Aronson & Krause, 1982). This format is likely to be still in the wider use among teachers, despite some objective 
shortcomings such as the high possibility of guessing the correct answers, assessment of knowledge at the level 
of reproduction, lack of creative thinking etc. On the other hand, the academic community offers a wide range of 
novel approaches in the field of students’ assessment, which are far less represented in teaching practice, but which 
seems to require the greater engagement of students, thus leading to the optimization of the learning process. 

As the author of this Editorial, reflection on this topic from the perspective of personal research experience, 
seems appropriate. First, one should look at the multi-tier tests that were created from the real need to eliminate 
the shortcomings of the common multiple-choice questions. From the pioneering work of Treagust (1986), and 
later on in the fields of physics (Caleon & Subramaniam, 2010; Chu, Treagust, & Chandrasegaran, 2009), chemistry 
(Costu, Ayas, Niaz, Unal, & Calik, 2007, Milenkovié, Hrin, Segedinac, & Horvat, 2016; Yan & Subramaniam, 2018), 
astronomy (Kanli, 2014), biology (Arslan, Cigdemoglu, & Moseley, 2012; Kilig & Saglam, 2009), researchers have 
continuously reported on the benefits of applying two-tier, three-tier and four-tier tests. The basic advantage over 
common multiple-choice tests is reflected in the fact that multi-tier tests do not only examine the phenomenon 
but also probe the reasoning behind it, thus requiring greater involvement of students. 

Further, in the context of the recently described forms of assessment, systemics should be mentioned as well. 
This form of assessment was introduced by Fahmy and Lagowski (2002) to promote meaningful learning. These 
authors presented systemic questions — closed, interacting conceptual systems, which require students to correlate 
the existing concepts and discover new relations among them. Further development of these tasks by different 
authors (Hrin, Fahmy, Segedinac, & Milenkovi¢, 2016; Hrin, Milenkovic, & Segedinac, 2018; Vachliotis, Salta, Vasiliou, 
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&Tzougraki, 2011), led to the creation of very effective tools that enable development of higher-order thinking skills. 

Integral parts of the curriculum of every physical science subject are problem solving tasks. However, the 
majority of problem-solving tasks encountered at primary, secondary and even tertiary level of education require 
lower-order cognitive skills to reach a solution. It means that such tasks can be solved successfully, using appropriate 
algorithms without proper conceptual understanding. To solve this problem, researchers (Overton & Potter, 2008; 
Overton, Potter, & Leng, 2013) have developed open-ended questions based on real-life context, correct solution 
of which is not unique, i.e. there is more than one acceptable answer. Such tasks require higher-order skills and 
are highly desirable in teaching practice. 

The above-mentioned examples are just a few of the current forms of assessment that arose as a result of 
the science education research, while there are far more positive examples in the literature. Nonetheless, based 
on the previous, it is clear that good assessment practices play a significant role in fostering effective learning and 
therefore their application in teaching practice is extremely important. The prime question that arises here is how 
familiar the teachers are with these modern forms of assessment? In different countries, the situation varies but 
generally holds the opinion that teachers do not receive much formal training in assessment design within initial 
teacher training. For this reason, it often happens that teachers rely heavily on the assessment tools provided by 
textbook publishers or some other sources, which do not support students’ learning. This certainly points to the 
need to modernize and adapt curricula of relevant courses within initial teacher education, regarding assessment 
design and analysis, but besides that, it points to the need for additional programs of continuing professional 
development that would encompass training in assessment. 

Finally, some dilemmas remain to be considered: 

(i) Do teachers really have a choice? 
(ii) What will be the consequences for those who opt for good teaching practice instead of good scores 
on the test? 
(iii) What can be done on this matter? 
Either way, although we can have different views on the issue, the ultimate goal should be the same for all 
of us —- students with knowledge beyond memorization and algorithms. 
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